VIRTEX - A German-Russlan Translation Experiment

نویسندگان

  • Bianka Buschbeck-Wolf
  • Renate Henschel
  • Iris Hoser
  • Gerda Klimonow
  • Andreas Kustner
  • Ingrid Starke
چکیده

Machine Translation from German into Russian did not gain the attention it deserves so far. Especially such typical Russian phenomena like the aspect of verbs complicate the translation process. The known MT approaches do not deal with such problems in a satisfactory way. To overcome this lack, we started the German.Russian translation experiment VIRTEX. We focussed our attention on the crucial point -a high quality translation of verbal groups (VIRTEX = Verbs-intoRussian Translation Experiment). That means, VIRTEX is made to solve on principle the linguistic difficulties connected with verb translation: recognition of German analytical tense forms, passive constructions, reflexive forms and verbs with detached prefixes analysis of complex German verb groups (modal and modal-like constructions) selection of Russian equivalents according to valency frames including seman ~ tic features of verb complements choice of the appropriate Russian aspect form passive-active transformation in case of missing Russian passive form adequate translation of modal constructions. The used vocabulary, above 200 sample verbs, is selected to represent these phenomena completely. To translate verbal groups adequately, £t is necessary to have some context. Tberefore we accept as input simple sentence 9 with verb complements and adverbials. That means we have also a set of nouns, pronouns, adverbs and some other words, The restricted syntax model is made for short main clauses; it excludes rela-tire and other subordinated clauses, attributes and infinitive clauses with complements. VIRTEX can be classified as an NT system between second and third generation with a syntactic interface structure completed by some semantic information. The system's architecture is the classical one. Analysis, Transfer and Synthesis are sequentially processed separate routines. For each of them an extra lexicon has been implemented. The inputwork is supported by special tools including an automatic generation of the inflectional types. While the procedures are written in LISP, the dictionaries and their editors are made in dBASE. The format restrictions of dBASE are compensated by a special management of the entries and a corresponding way of LISP access to them. VIRTEX runs on PC-XT. Some steps of the system will shortly be illustrated.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

VIRTEX - a German-Russian Translation Experiment

Machine Translation from German into Russian did not gain the attention it deserves so far. Especially such typical Russian phenomena like the aspect of verbs complicate the translation process. The known MT approaches do not deal with such problems in a satisfactory way. To overcome this lack, we started the German.Russian translation experiment VIRTEX. We focussed our attention on the crucial...

متن کامل

Limits Of A Sentence Based Procedural Approach For Aspect Choice In German-Russian MT

In this paper we discuss some problems arising in German-Russian Machine Translation with regard to tense and aspect. Since the formal category of aspect is missing in German the information required for generating Russian aspect forms has to be extracted from different representation levels. A sentence based procedure for aspect choice in the MT system VIRTEX is presented which takes lexieal, ...

متن کامل

Building a German/Simple German Parallel Corpus for Automatic Text Simplification

In this paper we report our experiments in creating a parallel corpus using German/Simple German documents from the web. We require parallel data to build a statistical machine translation (SMT) system that translates from German into Simple German. Parallel data for SMT systems needs to be aligned at the sentence level. We applied an existing monolingual sentence alignment algorithm. We show t...

متن کامل

Automatic speech recognition and translation of a Swiss German dialect: Walliserdeutsch

Walliserdeutsch is a Swiss German dialect spoken in the south west of Switzerland. To investigate the potential of automatic speech processing of Walliserdeutsch, a small database was collected based mainly on broadcast news from a local radio station. Experiments suggest that automatic speech recognition is feasible: use of another (Swiss German) database shows that the small data size lends i...

متن کامل

UC Berkeley at CLEF 2003 - Russian Language Experiments and Domain-Specific Cross-Language Retrieval

As in the previous years, Berkeley’s group 1 experimented with the domain-specific CLEF collection GIRT as well as with Russian as query and document language. The GIRT collection was substantially extended this year and we were able to improve our retrieval results for the query languages German, English and Russian. For the GIRT retrieval experiments, we utilized our previous experiences by c...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1990